Enhancing Information Accessibility of Publications with Text Mining and Ontology

نویسندگان

  • Weijia Xu
  • Amit Gupta
  • Pankaj Jaiswal
  • Crispin Taylor
  • Patti Lockhart
چکیده

We present an ongoing effort on utilizing text mining methods and existing biological ontologies to help readers to access the information contained in the scientific articles. Our approach includes using multiple strategies for biological entity detection and using association analysis on extracted analysis. The entity extraction processes utilizes regular expression rules, ontologies, and keyword dictionary to get a comprehensive list of biological entities. In addition to extract list of entities, we also apply natural language processing and association analysis techniques to generate inferences among entities and comparing to known relations documented in the existing ontologies. Keywords—component; Information systems applications; Ontology; Text Mining; Association Analysis

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

BioKB - Text mining and semantic technologies for the biomedical content discovery

The ever-increasing number of publicly available biomedical articles calls for automatic information extraction from digitized publications. We have implemented a pipeline which, by exploiting text mining and semantic technologies, helps researchers easily access semantic content of thousands of abstracts and full text articles from PubMed and Elsevier. The text mining component analyzes the ar...

متن کامل

Presenting a method for extracting structured domain-dependent information from Farsi Web pages

Extracting structured information about entities from web texts is an important task in web mining, natural language processing, and information extraction. Information extraction is useful in many applications including search engines, question-answering systems, recommender systems, machine translation, etc. An information extraction system aims to identify the entities from the text and extr...

متن کامل

Semantics - based Text Mining of Biomedical Concepts in

Searching publications for prior work on scientific concepts is central to the research process. The relevant parts of retrieved publications are typically found and evaluated manually. In the field of biomedicine, due to rapidly growing numbers of publications and the of lack standard scientific terminologies, this task is particularly challenging, complex and time consuming. Prior information...

متن کامل

Journal of International Scientific Publications

In recent years, several approaches have been proposed to extract information from web pages on the internet. In this research, a key technique focused on crawling and ontology used to discover knowledge from web. In this paper, we present intelligent crawling system that uses pattern and ontology to extract particular information from WEB sites. The system developed as an efficient tool to con...

متن کامل

Benchmarking ontology-based annotation tools for the Semantic Web

This paper discusses and explores the main issues for evaluating ontology-based annotation tools, a key component in text mining applications for the Semantic Web. Semantic annotation and ontologybased information extraction technologies form the cornerstone of such applications. There has been a great deal of work in the last decade on evaluating traditional information extraction (IE) systems...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016